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Abstract 

We study local search algorithms for metric instances of facility location problems: the uncapacitated 
facility location problem (UFL), as well as uncapacitated versions of the fc- median, fc-center and k- 
means problems. All these problems admit natural local search heuristics: for example, in the UFL 
problem the natural moves are to open a new facility, close an existing facility, and to swap a closed 
facility for an open one; in fc-medians, we are allowed only swap moves. The local-search algorithm 
for fc-median was analyzed by Arya ct al. {SIAM J. Comput. 33(3):544-562, 2004), who used a clever 
"coupling" argument to show that local optima had cost at most constant times the global optimum. 
They also used this argument to show that the local search algorithm for UFL was 3-approximation; 
their techniques have since been applied to other facility location problems. 

In this paper, we give a proof of the fc-median result which avoids this coupling argument. These 
arguments can be used in other settings where the Arya et al. arguments have been used. We also 
show that for the problem of opening k facilities F to minimize the objective function ^p{F) = 

(Sjev ^0'-^)^)^^^' the natural swap-based local-search algorithm is a 9(p)-approximation. This 
implies constant-factor approximations for fc-medians (when p = 1), and fc-means (when p = 2), and 
an 0(logn)-approximation algorithm for the fc-center problem (which is essentially p = logn). 

1 Introduction 

Facility location problems have been central objects of study in the operations research and computer 
science community, not only for their intrinsic fundamental nature and broad applicability, but also as 
problems whose solutions have led to the development of new ideas and techniques. Indeed, techniques 
such as rounding linear relaxations, using primal-dual techniques and Lagrangean relaxations, greedy 
algorithms (with and without the dual- fitting approach), and local search algorithms have all been 
honed when applied to facility location problems. 

Local search has been a popular algorithm design paradigm, and has had many successes in the design of 
approximation algorithms for hard combinatorial optimization problems. The focus of this paper is on 
local search algorithms for facility location problems on metric spaces. In fact, the best approximation 
algorithm known for the /c-median problem is a (3 -|- e)-approximation via local search [AGK"'"04]. How- 



ever, the analysis of this simple local search algorithm is fairly subtle, and requires a careful "coupling" 
argument. The same coupling argument was used by [ A.GK"'"04 ] to also analyze a local search algorithm 



for uncapacitated facility location, and subsequently by |DGK"'"05, Pan04] for some other problems. 
In this paper, we present the following: 
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We give somewhat simpler analyses for the natural local search algorithms of [ AGK"'"04, DGK+05, 
Pan04 1 : while the approximation guarantees remain the same, the proofs are arguably more 
intuitive than existing proofs. 



• We show that the problem of opening k facilities F to minimize the objective function 




the natural swap-based local-search algorithm is a 0(p)-approximationQ: this immediately implies 
constant factor approximations for A;-medians (which corresponds to the case /c = 1), /c- means (the 
case p = 2), an 0(logn)-approximation local search algorithm for the /c-center problem (which is 
essentially the case p = logn). To the best of our knowledge, the results for p 7^ 1 give the first 
analyses of local search heuristics for these problems. 

Technical Ideas. The main contribution of this work is the simplification of the proofs. As with most 
local search proofs, the previous papers considered a local optimum, and showed that since a carefully 
chosen set of local moves were non-improving, we could infer some relationship between our cost and 
the optimal cost. However, in the previous papers, this set of local moves have to be carefully defined 
by looking at how the clients served by each optimal facility were split up between facilities in the local 
optimum. Even bounding the change in cost due to each of these potential local moves is somewhat 
non-trivial. 

In our paper, we define a set of local moves based only on distance information about which of the 
optimal facilities F* are close to which of our facilities F. The intuition is simple: consider each of the 
optimal facilities in F* , and look at the closest facility to it in If some facility / G F is the closest 
to only one facility /* F* , then we should try swapping / with /*. However, if there is some facility 
f & F that is the closest facility to many facilities in F*, then swapping / might be bad for our solution, 
and hence we do not want to close this facility in any potential move. Formalizing this natural intuition 
gives us the claimed simpler proofs for many problems. 

1.1 A Note on the Rate of Convergence. 

In this paper, we will only focus on the quality of the local optima, and not explicitly deal with the 
rate of convergence. This, however, is only for brevity: since all our arguments are based on averaging 
arguments, we can show that if at any point in the local search procedure, the current solution is 
very far from every local optimum, then there is a step that reduces the objective function by a large 
amount. Moreover, since we are dealing with approximation algorithms, we can stop when each local 
step improves the objective function by at most a factor of (1 -|- e), which would give us only a slightly 
worse approximation guarantee but would keep the running time polynomial in n and e~^. These details 
are fairly standard; e.g., see the discussion in 

1.2 Other Related Work 



AGK^04|. 



Facility location problems have had a long history; here, we mention only some of the results for these 
problems. We focus on metric instances of these problems: the non-metric cases are usually much 
harder [lHoc82|1 . 

^When we say that our local search algorithm is a p-approximation, we mean that the cost of every local optimum is 
at most p times the opt imum cost. In this paper, we also implicitly mean that one can find a solution of cost (p + e)OPT 
in time poly(n,e) — see Section 1.2 for details. 
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A:-Median. The A;-median problem seeks to find facilities F with \F\ = k to minimize ^j^y d{j, F)'^ . 
The first constant factor approximation for the k-median problem was given by Charikar et al. iCGTS02i , 
which was subsequently improved by [CG05| and | AGK+04] to the current best factor of 3 + e. It is 
known that the natural LP relaxation for the problem has an integrality gap of 3, but the currently- 



known algorithm that achieves this does not run in polynomial time [A.RS03|. The extension of fc-median 
to the case when one can open at most k facilities, but also has to pay their facility opening cost was 
studied by [ pGK+05|| , who gave a 5-approximation. 

A;-Means. The fc-means problem minimizes X^jgy d{j, F)'^ , and is widely used for clustering in machine 
learning, especially when the point set is in the Euclidean space. For Euclidean instances, one can obtain 
(1 + e)-approximations in linear time, if one imagines k and e to be constants: see [KSS04] and the 
references therein. The most commonly used algorithm in practice is Lloyd's algorithm, which is a local- 
search procedure different from ours, and which is a special case of the EM algorithm |Llo82]. While 
there is no explicit mention of an approximation algorithm with provable guarantees for fc-means (to 
the best of our knowledge), many of the constant-factor approximations for /c- median can be extended 
to the /c-means problem as well. The paper of Kanungo et al.| KMN"'"04 | is closely related to ours: 
it analyzes the same local search algorithm we consider, and uses properties of k-means in Euclidean 
spaces to obtain a 9-approximation. Our results for hold for general metrics, and can essentially be 
viewed as extensions of their results. 

fc-Center. Tight bounds for the /c-center problem are known: there is a 2-approximation algorithm 
due to iGon85| , |HS86|| , and this is tight unless P = NP. 

UFL. For the uncapacitated metric facility location (UFL) problem, the first constant factor approxi- 
mation was given by Shmoys et al. ||STA97 |; subsequent approximation algorithms and hardness results 
have be e n given by [PTA97|, |Chu98|, ^^A02\ , |CSO^ , py?07| , [jVUl] , |CG05| , [PT03| , |MMSV01| , |JMS02| , |MYZ02 



KPROC, CG05, AGK_Oi, GK99|. It remains a tantalizing problem to close the gap between the best 



known approximation factor of 1.5 pyr07| ] , and the hardness result of 1.463 | GK9£ ]. 



1.3 Organization of the Paper 



The paper is organized as follows. In [Section 2 , we consider the metric /c-Median problem and present an 
analysis showing a 5 approximation. Following that, we analyze a version of the local-search algorithm 



for A;-Median that allows t simultaneous swaps, where we show a (3 + 2/t) approximation. In Section 3 
we consider a common generalization of the A;-median, A;-means, and fc-center problems, called the 
ip-noim fe-facility location problem, where we give an 0{p) approximation algorithm, and present an 
instance which is asymptotically tight. Finally, in [Section 4 , we present simpler proofs of two other 
known results: a 3 approximation for the uncapacitated facility location problem, and a 5 approximation 
for the /c-uncapacitated facility location problem. 

2 A Simpler Analysis of A:-Median Local Search 

In this section, we study the local search algorithm for the fe-median problem, and show that any local 
optimum has a cost which is at most 5 times the cost of the global optimum. Recall that given a set F 
of at most k facilities, the A;- median cost is 

kmed(F) = ^(i(j,F). 



The local search algorithm we consider in this section is the simplest one: we start with any set of k 
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facilities. At each point in time, we try to find some facility in our current set of facilities, and swap it 
with some currently unopened facility, so that the cost of the resulting solution decreases. It is known 
that any local minimum is a 5-approximation to the global minimum [AGK"'"04|, and that the bound 
of 5 is tight for these local-search dynamics. Here we give a simpler proof of this 5-approximation. 

2.1 A Set of Test Swaps 

To show that a local optimum is a good approximation, the standard approach is to consider a carefully 
chosen subset of potential swaps: if we are at a local optimum, each of these swaps must be non- 
improving. This gives us some information about the cost of the local optimum. To this end, consider 
the set F* of facilities chosen by an optimum solution, and let F be the facilities at the local optimum. 
Without loss of generality, assume that \F\ = \F*\ = k. 





Figure 2.1: An example mapping rj: F* ^ F and a set of test swaps S. 

Define a map rj : F* ^ F that maps each optimal facility /* to a closest facility r]{f*) £ F: that is, 
dif*,vir)) < d{f*,f) for ah f € F. Now define C F to be all the facilities that have at most 1 
facility in F* mapped to it by the map rj. (In other words, if we create a directed bipartite graph by 
drawing an arc from /* to i]{f*), R F are those facilities whose in-degree is at most 1). 

Finally, we define a set of k pairs 5 = {(r, /*)} C i? x F* such that 

• Each /* G F* appears in exactly one pair (r, /*). 

• If r/~^(r) = {/*} then r appears only once in S as the tuple (r, /*). 

• If r/~^(r) = then r appears at most in two tuples in S. 

The procedure is simple: for each r ^ R with in-degree 1, construct the pair (/, T]~^{r)) — let the optimal 
facilities that are already matched off be denoted by F^ . The other facilities in R have in-degree 
0: denote them hy Rq. A simple averaging argument shows that the unmatched optimal facilities 
\F* \F^\ < 2\Rq\. Now, arbitrarily create pairs by matching each node in Rq to at most two pairs in 
F* \ F* so that the above conditions are satisfied. 

The following fact is immediate from the construction: 

Fact 2.1 For any tuple (r, /*) e S and f* e F with f* ^ f*, 7]{f*) ^ r. 

Intuition for the Pairing. To get some intuition for why the pairing S was chosen, consider the case 
when each facility in F is the closest to a unique facility in F*, and far away from all other facilities in 
F* — in this case, opening facility /* G F* and closing the matched facility in f £ F can be handled by 
letting all clients attached to / be handled by /* (or by other facilities in F). A problem case would 
be when a facility / € F is the closest to several facilities in F*, since closing / and opening only one 
of these facilities in F* might still cause us to pay too much — hence we never consider the gains due to 
closing such "popular" facilities, and instead only consider the swaps that involve facilities from the set 
of relatively "unpopular" facilities R. 
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2.2 Bounding the Cost of a Local Optimum 



In this section, we use the fact that each of the swaps in set S constructed in Section 2.1 are non- 
improving to show that that the local optimum has small cost. 

Breaking ties arbitrarily, assume that ip : V ^ F and ip* : V ^ F* are functions mapping each client to 
some closest facility. For any client j, let Oj = d{j, F*) = d{j, be the client j's cost in the optimal 

solution, and Aj = d{j, F) = d{j, ^(j)) be it's cost in the local optimum. Let N*{f*) = {j \ ip*{j) = /*} 
be the set of clients assigned to /* in the optimal solution, and N{f) = {j \ ip{j) = /} be those assigned 
to / in the local optimum. 



Lemma 2.2 For each swap {r, f*) G S, 

kmed(F + f* - r) - kmed(F) < 



E 2 Or 

jeN{r) 



(2.1) 



Proof: Consider the following candidate assignment of clients (which gives us an upper bound on the 
cost increase): map each client in N*{f*) to /*. For each client j G N{r) \ N*{f*), consider the figure 
below. Let the facility /* = (p*{j): assign j to f = r]{f*), the closest facility in F to /*. Note that 
by Fact 2.1, f ^ r, and this is a valid new assignment. All other clients in 1/ \ {N{r) U N*{f*)) stay 
assigned as they were in ip. 




Note that for any client j S N*(f*), the change in cost is exactly Oj — Aj: summing over all these 



clients gives us the first term in the expression (2.1 



For any client j G N{r) \ N*{f*), the change in cost is 

d{j, f) - dU, r) < d{j, n + d{f\ f) - dU, r) 
<dij,n+d{f\r)-dij,r) 



<dij,n+d{j,n 



20,. 



(2.2) 
(2.3) 
(2.4) 



with ( p. 2D and (|2.4D following by the triangle inequality, and ( |2.3D using the fact that / is the closest 
vertex in F to f*. Summing up, the total change for all these clients is at most 

E 2 0,< 5] 2 0„ (2.5) 

jeN{r)\N*{r) jGNir) 

the inequality holding since we are adding in non- negative terms. This proves Lemma 2.2| . ■ 

Note that summing ( |2.lD over all tuples in S, along with the fact that each /* G F* appears exactly 
once and each r R Q F appears at most twice gives us the simple proof of the following theorem. 



Theorem 2.3 ( [1AGK+04|] ) At a local minimum F, the cost kmed(F) < 5 • kmed(F*). 
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2.3 A Projection Lemma 



Like the proof for A;-Median, most proofs in the remaining of the paper also rely on a candidate assign- 
ment of clients to give an upper bound on the cost increase. In particular, these proofs often reassign a 
client j to the image of the facility (p*{j) that serves j in the optimal solution. The following 

lemma bounds the distance between client j and the facility 'r]{ip* (j)); its proof is exactly the same as 

in e-di. 

Lemma 2.4 (Projection Lemma) For any client j E C, 

d{j,7]{^*{j))<20,+Aj. 
2.4 The A;-Median Local Search with Multiswaps 

The paper of Arya et al. [[AGK"'"04| ] also showed that local minima under t-swaps for the /c-median 
algorithm have cost at most (3 + |) times the optimum, where a 4-swap involves shutting down t of the 
current set of k facilities and opening t new facilities in their stead. (Note that the best t-swap can be 
found in time n^^^\ and hence is polynomial-time for constant t.) Using the notation from the previous 
section, we will show that kmed(-F) < (3 + |)kmed(F*), where F denotes the set of facilities at the local 
optimum, and F* the set of facilities in an optimal solution. 

2.4.1 A Set of Test t-Swaps 

In order to extend the proof of the previous section to the t-swap algorithm, we generalize the set of test 
swaps as follows. The set of test t-swaps relies on the same intuition as before, but it is more sophisticated 
for technical reasons. For each element r of F, define the degree of r to be deg(r) = |77~^(r)|. The 
pairing is constructed by the following procedure. 

Algorithm 1 Pairing construction 
i = 

while there exists r ^ F such that deg(r) > do 

(1) Ri = {r} U {any deg(r) — 1 elements of F with degree 0}. 

(2) F*=,^-\R,). 

(3) F = F\Ri, F* = F*\F*,i = i + l. 
end while 

Ri = F, F* = F*,r = i. 



When the algorithm terminates, the sequences {Ri}l^i and {F*}^^^ form partitions of F and F* , 
respectively. It is straightforward to show that (a) in all iterations, step (1) will be able to find 
deg(r) — 1 elements of degree 0, (b) \Ri\ = \F*\ for alH = 1, 2, . . . , r, and (c) all elements of Rr have 
degree 0. Moreover, the following holds. 

Fact 2.5 Ifj e Ri and ip*{j) F* , then Ri- 

2.4.2 The Cost of a Local Optimum: the t-Swap Case 

Based on the potential t-swaps, we now give upper bounds on several possible moves. The following 
proofs are largely similar to the proof of Lemma 2!^ . 
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Lemma 2.6 If \Ri\ = \F*\ < t, then 

kmed([F \ Ri] U F*) - kmed(F) < ^ [Oj - Aj) + ^ 20j (2.6) 

Proof: Consider the following assignment of clients after removing Ri and adding F*: map each client 
j G N*{F*) to ip*{j) and map each client j S N{Ri) \ N*{F*) to r]{ip*{j)). All other clients stay where 



they were. We know from [Fact 2.q that this assignment is legal, because we do not assign any clients to 
the facilities being removed. The assignment gives an upper bound on the cost of kmed([-F \ Ri] L) F*). 
Applying the projection lemma ( p^emma 2.4 ), we know that each j € N{Ri) has an increase at most 



20 j. The rest of the proof is similar to the proof of Lemma 2^ . 
Lemma 2.7 // \Ri\ = \F*\ = s > t, then 
1 



[kmed(F + r-r)-kmed(F)] < ^0,-A,)+ ^ 2 + O,-, (2.7) 



S-l ^ 

{f,r)&F*xR, j€N*(F*) jeN'iRi) 

where Ri is a set of s — 1 degree-0 elements of Ri 



Proof: Consider each individual swap in {f*,r) € F* x Ri that removes r and adds /*. By Fact 2.5 
we know that the assignment in [Lemma 2^ is legal. Thus the same analysis as in Lemma 2^ gives that 
kmed(F + f*-r)- kmed(F) < j:,eN'iniO, - A,) + E,^;vM 2 O,. 

Now consider each j G N*{F*). We know that /* appears in exactly s — 1 pairs of F* x Ri, so summing 
over these, we have J2jeN'{F*)(^j ~ ^j)- Likewise, each j £ N*{Ri) appears in exactly s pairs of 

F^ X Ri. But, < 1 + so summing over these, we have J2jeN*{Ri) ^(1 + j) Oj. Together, we have 
the lemma. ■ 

We now combine the two Lemmas. Note that for \Ri\ = \Fl\ < t, the optimality condition implies 
that kmed([F \ Ri] U F*) - kmed(F) > 0, and for ]Ri] = ]F*] > t, the optimality condition gives that 
kmed(F + f* - r) - kmed(F) > for all /* G F* and r e Ri. Therefore, by noting that {Ri}l^-^ and 
{F*}l^i are partitions of F and F*, respectively, we establish 

0< ^ fkmed([F\i?i] Ui^*) - kmed(F) j + ^ lrFT ^ kmed(F + /* - r) - kmed(F) 

i:\Ri\<t r.\Ri\>t * (/*^r)eF*x_Ri 



E (0,-A,)+ 2{l + l/t)0, 

■je7V*{F;) jeN{R,) 



i=l 

which proves the following theorem 



kmed(F*) - kmed(F) + 2(1 + l/i)kmed(F*), 



Theorem 2.8 ( [|AGK+04|] ) At a local minimum F for the t-swap version of the k-median local search 
algorithm, kmed(F) < (3 + 2/t) kmed(F*). 

3 The ^p-Norm A;-Facility Location Problem 

In this section, we consider the following common generalization of the /c-median, fc-means, and A;-center 
problems: given a metric space with a point set V and distances d{-, •), and a value p > 1, the ip-norm 
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k-facility location problem is to find a set F of k facilities to minimize the objective function 

i/p 



(3.8) 



Like the fe-median problem, this setting has a natural local-search algorithm: Starting with k facilities, 
the algorithm tries to find i £ F and j ^ F such that the "swap" F' = {F \ {i}) U {j} improves 
the objective value. The algorithm stops when no improving move exists. We will show that this 
algorithm is a bp approximation, which can be improved to a (3 + |)p approximation by allowing t- 
swaps. This algorithm is asymptotically tight as we will show in [Section 3.3 . Even for the single-swap 



case, this immediately implies a 5- approximation for /c-median, a 10-approximation for the A:-means 
problem (which can be improved to 9), and an 0(log n)-approximation for the /c-center problem. The 
implication for fc-center follows from the fact that for vectors of length n, the norm || • IIq^ is within a 
constant factor of the norm || • ||iog„, and hence maxjgy d{j, F) is within a constant factor of '&iogn(-^)- 
To the best of our knowledge, the results for p ^ 1 give the first analyses of local-search heuristics for 
these problems on general metric spaces. 

3.1 Analyzing the Single-swap Case 

We begin with the single-swap case and analyze the t-swap case in the next section. Let A{p,q) = 
(P{p, q), and assume that V is indexed by {1, 2, . . . , re}. We borrow notations and definitions from our 
analysis of A;-Median in Section 2 . To argue about the quality of our solution, we will use the set S of 



test swaps defined in Section 2.L Using an argument similar to the proof of the A;-Median problem, we 
establish: 

(r,/*)e5 (r-,/*)G5 \jGN'if) j€Nir) 

^ E - 3 E + 2 E ^(^^ '?(^* 0'))) = - + 2 E ^(J' ^(^* 0'))) (3.9) 

j£V jev j£V j£V 

We proceed to derive an upper-bound on the sum X^jgy A(j, 77(99* (j))) in terms of ^p{F*) and ^p{F) 
as follows. 

Claim 3.1 

Y,^{jMv*m < {2%{F*) + MF)y 

Proof: Define x = ((i(j, 77(99* (j))))J_^, and y = (^2d{j , ip* (j)) + d{j,ip{j))'j'^_-^. The Projection Lemma 
( [Lemma "lA ) gives that a^j < yj for all j G V, and thus < \\y\\i^^- Now note that ||((i(i, 9'*(j)))"^-^||^p 



^p{F*) and \\{d{j, ip{j)))^^-^\\£p = $p(F). Therefore, applying the triangle inequality on the (M", 
space, we have Ej^v Mj'^^iv* U))) = Mi < Mi < {2%{F*) + $p(F))P. ■ 
Together with ( [3.9[ ), this claim implies 

< $P(F*) - 3$P(F) + 2(2^>p(F*) + %iF)Y (3.10) 

To complete the proof, we let a be the smallest positive number such that ^p{F) < ctP ■ ^p{F*) and 
show that a < 5p. Suppose for a contradiction that a > bp. Consider the right-hand side of inequality 
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( |3l0D , which we rewrite as fohows. 

- 3<^P{F) + 2(2$p(F*) + %iF)Y = a^'$^(F*) (^i- - 3 + 2(1 + 2/a)f ) (3.11) 

Let f{p,a) = ^ + 2(1 + — 3. Since a^$p(-F*) is always non-negative, if we know that, for 

a > bp, f{a) < 0, we will have a contradiction to (|3.10| ). For p = 1 and 2, we can explicitly solve 
for a. For p > 2, we know that if a > bp, then ^ < 1/100 and (1 + 2/a)P < e^/^. Thus, we have 
^ + 2(1 + 2/a)P < + 2 • e^/^ — 3 < 0. The results can be summarized in the following theorem: 

Theorem 3.2 The natural local-search algorithm for the ip-facility location problem gives a bp approx- 
imation guarantee. Additionally, for p = 2, this algorithm gives a 9 approximation guarantee on general 
metric spaces. 

3.2 Analyzing the t-Swap Case 

We will now analyze the t-swaps case. The analysis here will be largely similar to the analysis of the 
single-swap case, except for few extra ingredients which we will now develop. Our analysis will use the 



set of test swaps from Section 2.4. We will borrow notations and definitions from our analysis of the 



t-swap case of fc-Median in Section 2.4 



Let us recall that F and F* denote the facilities in our solution and the facilities in the optimal solution, 
respectively; the sequences {Ri}\^i and {F*}^^^ denote partitions oi F and F* , respectively. We derive 
the following upper bounds. 

Lemma 3.3 // \Ri\ = \F*\< t, then 

<I>^([F\i?,]Ui^r)- W< E E (A(i,r?((^*(i)))-Ap (3.12) 

Proof: Consider the following assignment of clients after removing Ri and adding F*: map each client 
j £ N*{F*) to (p*{j) and map each client j G N{Ri) \ N*{F*) to r]{ip*{j)). All other clients stay where 



they were. We know from [Fact 2.5| that this assignment is legal, because we do not assign any clients to 
the facilities being removed. The assignment gives an upper bound on the cost of ^p{[F \ Ri]U F*) . This 
immediately gives ^f,{[F\R,]^F*) -<l>f,{F) < E,eiv-(F*)(0,-^P + E,e7V(R,)W*(F*)(^(j' " 
< $^([F \ Ri] U F*) - $^(F) < E,e7V.(F;)(0," " A') + E,^NiR^)iMj, vi^* U))) " ^p, which proves 
the lemma. ■ 

Lemma 3.4 // \Ri\ = \F*\ = s > t, then 



^— Yl iniF + f* - r) - ^Pp{F)] < Y (Oj-A^^) 

+ E (l + ^)(A(i,r?((^*(j)))-Ap, 



* (/*,r)GF*x/?, j€N*{F*) 



where Ri is a set of s — 1 degree-^ elements of Ri . 

Proof: Consider each individual swap in (/*, r) € F* x Ri that removes r and adds /*. Again, we will 
reassign every j S N*{f*) to /* and every j € N{r) \ N*{f*) to r, and keep all other clients where 
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they were. By [Fact 2.5| , we know that this assignment is legal, and thus ^p{F + f* — r) — < 

Now consider each j € N*{F^). We know that /* appears in exactly s — 1 pairs of x Ri, so summing 
over these, we have YljeN*(F*)(^] ~ ^j)- Likewise, each j G N*{Ri) appears in exactly s pairs of 

F* X Ri. But, ^ < 1 + J, so summing over these, we have Y.jeN'{R,)i'^ + l) iMj^viv* U))) - ^p- 
Together, we have the lemma. ■ 

Using these two lemmas and the fact that {Ri}l^i and {F*}^^^ are partitions of F and F*, we establish 
the following bound: 

E E (l + ^)(A(j,7?(c^*(j)))-^p) (3.13) 

= - (2 + i)^^(F) + (1 + ^) E A(j,r?(v,*(i))). (3.14) 



As shown in Claim 3.1 , the sum ^j^y 'nif*U))) is upper-bounded by (2$p(F*)+<I)p(F))P; therefore, 
we have 

cI>^(F*) - (2 + i)ci>^(F) + (1 + ^)(2c^,(F*) + cl.,(F))^ > (3.15) 
We are now ready to prove the following theorem: 

Theorem 3.5 For any value of t ^ Z+, the natural t-swap local-search algorithm for the £p-facility 
location problem yields the following guarantees: 

1. For p = 1, it is a (3 + |) approximation. 

2. For p = 2, it is a (5 + |) approximation. 

3. For all reals p>'l, it is a {3 + j)p approximation. 

Proof: Let a be the smallest positive number such that ^p{F) < dP ■ ^p{F*) in inequality ( 3.15| ). For 
p = 1 and p = 2, we can directly solve the inequality and obtain the desired results. For p > 2, we 
assume for a contradiction that a > (3 + |)p and proceed as follows. With the assumption, we have 

%{F*) - (2 + i)$^(F) + (1 + ^)(2$,(F*) + cI>,(F))^ 

< aP 



*!:(n(i-(2+i) + (i + i)(i + |r) 

a'*?(F.)(-l + (l + |r-2 + i((l + fr-l)) 



Now define f{p) = ( (3+2/f)p )^+ + (3+2/% )^ and g{p) = (l+ (g+f/t)^)^- It is easy to see that for p > 2, 
f{p) and g{p) are non-decreasing functions, and thus for allp > 2, f{p) < limp^oo f{p) = exp {2^^} and 
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2+t 



g{p) < limp^oo gip) = exp { 2^1^}. Note also that exp{2^} = exp{l - < e • (l 2+3* 

< aP$P(F*)(^e2*/(2+3t) _ 2 + i(e2</(2+3t) _ 



) 



e • 



2t 
2+3f 



, {2 + -)^P{F) + {l + -){2%{F*) + <^,{F)y 



<„P$P(F*)(l±l.e 



2t 



1 



2 + 3i t 



<a^^g(F*)( ^^(^ + ^) _1_2 



Simple algebra shows that ^'^^^"'"^^ 



2+3* 

) 

2+3t t 



i < 1.9; therefore, ^^p{F*) - (2 + + (l + i)(2$p(F*) + 



$p(i^))*' < - i - 2) < 0, which gives a contradiction. 



3.3 An Asymptotically Tight Example 



Inspired by a lower bound given by Kanungo et al. |KMN^04|, we present an example where the local- 
search algorithm produces an r2(p)-approximate solution for the the ^p-facility location problem. The 
example presented below is designed for the single-swap case, but it can be generalized to the t-swap 
case. 

Theorem 3.6 For every p, there are instances of the ip-norm k-facility location problem where the cost 
of some local minima (under the standard local moves) is at least 2p times the optimal cost. 

Proof: Consider a 2-dimensional torus with lattice points {0, 1, . . . , — 1}^ for some large integer N. 
The lattice points are labeled even or odd according to the parity of the sum of their coordinates. For 
a value x to be fixed later, we define a gadget D{x) to be a set of 4 points at (±x, 0), (0, ±x). 

We overlay a graph on the torus as follows. Every lattice point is a facility node. Centered at each even 
lattice point is a copy of D{x). These gadget points make up our client nodes. To set up a distance 
metric, we introduce the following edges and define the distance between any two nodes as the shortest 



path between them. As shown in Figure 3.3 , an even lattice point is at distance x from any of its 
surrounding gadget points, and an odd lattice point is at distance 1 — x from its (physical) neighboring 
gadget points. Under this distance metric, the distance between a and b, for example, is 1 -|- x. 





Figure 3.2: A lower-bound instance 
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Let X = l/(2p+ 1) and k = N'^/2. For this choice of k and x, an optimal solution, denoted by -F*, opens 
facilities at all even lattice points, yielding ^p{F*) = (4:k)^^^x. To get a 2p approximation, consider a 
solution F which opens k facilities at all odd lattice points. It is easy to see that ^p{F) = (4A;)^/*'(1 — x), 
and so ^^l^p}^ = 2p. It remains to show that F is a local optimum. Consider that, for any f*£F and 
r G F, if we shut down a facility r and open a facility /*, the change in cost is given by 

- r + /*) - ^P{F) > i(^xP - {1 - x)P^ + 3 (^(1 + x)P - (1 - x)p^ . (3.16) 

With our choice of x, it is straightforward to show that ^p{F — r + /*) — ^p{F) > 0. Thus, the solution 
F is a local optimum as desired. ■ 

4 Simpler Proofs for Other Previously Known Results 
4.1 Uncapacitated Facility Location 

In the metric uncapacitated facility location problem, instead of a hard upper bound k on the number 
of facilities, we are given an opening cost fi for each location i & V, and the goal is to minimize the 
objective function 

UFL(F) = ^/, + ^ci(j,F). 
This problem has been extremely widely studied and many constant-factor approximation algorithms 



are known: see Section 1.2 for many references. 



The Local Search Moves. Since we do not have a hard bound on the number of facilities, we can 
have a richer set of local moves — (a) opening a facility, (b) closing a currently open facility, and (c) 
swapping facilities as above. Again, we let F* be the optimal set of facilities, and F be the algorithm's 
set of facilities at a local minimum. 

4.1.1 Bounding the Local Optimum Cost for UFL 

Since we have been using / to denote a generic facility, let us use fac(/) to denote the facility opening 
cost for facility /, and fac(F') to denote the cost X^/^i?' fac(/) of a set F' of facilities. Again, for a 
client j € C, let Oj and Aj be the connection cost in the optimal and local-optimal solutions, and 
let ip* and if denote the maps assigning clients to facilities. Hence opt = fac(F*) -|- YljeC^j^ whereas 
alg = fac(F) -|- ^j^c^j- Moreover, for a facility f G F, let iV(/) denote the clients assigned to it; 
similarly, define N*(f*) to be the clients assigned to it in the optimal solution. The first lemma below 
is easy: try opening each facility in F*, note the change in cost, and add things up. 



Lemma 4.1 (Connection Cost [ KPROd| ]) At a local optimum, the fact that "open new facility" 



moves are non-improving implies the connection cost X^jgc^i — ^^^(-F*) +X]jgc^i' 



Lemma 4.2 (Facility Cost [|AGK+04|] ) The facility cost fac(F) < fac(F*) + 2Y^j^f^0j at a local 
optimum. 



Proof: Recall the notation of [Section 2| : given F* and F, let rj : F* ^ F map each optimal facility 
to a closest facility in F. Following [|AGK+04|| , call a facility / G F "good" if v~\f) = 0, and "bad" 
otherwise. 
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If a facility / is good, we can consider closing the facility and assigning any clients j G ]^{f) to 
/ = r]{ip*{j)): note that since / is good, we know that f ^ f, and hence this reassignment is valid. By 
the projection lemma ( [Lemma 2.4 ), the total increase in the assignment cost is d{j, f) — d{j, /) < 2 Oj, 
and hence from local optimality, we get that for any good f ^ F, 

-fac(/)+ 20, >0. (4.17) 

i67V(/) 

For a bad facility /, let PJ be the set ?7~^(/) = {/o , /i , . . . , ft} (with t > 0), and let /q be the closest 
one to /. We then consider the t possible moves of opening facility /* in Pj \ {/g }, and assigning any 
client j G N*{f*) n A^(/) to /*. The local optimality ensm'es that 

fac(/*)+ Yl {O,-A,)>0. (4.18) 

jeN''if*)nN{f) 

Moreover, consider the move of opening /q and closing /: 

• Any client j S N{f) with (p*{j) PJ is assigned to the facility r]{(p*{j)) 7^ /: the projection 
Lemma 2.4| implies that the increase in connection cost for such j is at most 2 Oj . 



• Any client j G N{f) with ip*{j) = f* G PJ (for some i G {0,1,..., t} is assigned to /q. The 
change in the connection cost is d{j, /g ) — (i(j, /). 

Hence, local optimality shows that 

t 

fac(/o*)-fac(/)+ Y 20, + Y {d{j,fS)-A,)>0. (4.19) 

Adding ( [4.19| ) with the t inequalities ( [4.18| ) (one for each i G {1, . . . , t}) gives us 

t 

fac(P;) - fac(/) + 20, Y id{jJ^) + O,-2A,)>0. (4.20) 

jeN{f)Av*{MPf i=o j^N*(f*)nN{f) 

Consider the rightmost sum in ( ^30| ): for i = 0, the summand is 2{Oj - Aj) <20j. For i ^ 0, 

dU, /o*) + d{j, ft) -2 d{j, f) < (dij, f) + d(/, /o*)) + d(j, f*)-2 dij, f) (4.21) 

<d{fjt)+d{jj*)-d{jj) (4.22) 
<2d{j,f:) = 20j, (4.23) 

where we used the fact that d(/, /q) < d{f,f*) in ( [4.22| ), and the triangle inequality at other places. 
Now the expression ( [4.20 ) can be simplified to say 

fac(P;) - fac(/) + EjeNif) 2 O, > 0. (4.24) 

Summing ( 4.24[ ) over all bad /, and ( [1.17[ ) over all the good /, we get 

fac(F*) - fac(F) + Yl^eC 2 O, > 0. (4.25) 

which proves the claimed bound fac(F) < fac(F*) + 2 0j. I 

Combining the facility cost and connection cost lemmas above results in the following theorem. 

Theorem 4.3 At a local optimum, UFL(F) < 2fac(F*) + 3 Ylj Oj < 3 UFL(F*). 
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4.2 A;-Uncapacitated Facility Location 



Building on the techniques developed in the previous sections, we can now give proofs for the metric 
k-uncapacitated facility location problem (A;-UFL) problem. This is a common generalization of the 
A;- median and UFL problems: not only do we have an opening cost fi for each location (like UFL), but 
we also have a limit k on the number of facilities (like /e-median). The goal is still to minimize the cost 

kUFL(F) = fac(F) + ^d(j,F), 
j&v 



where fac(F) = Yli&F fi- This problem was defined by pGK+05|] , whose main result — showing that 



local search was a 5-approximation — is reproved in this section. 

The Local Search Moves. We start with any set F of at most k facilities, and allow the following 
modes: the algorithm can open a new facility if l-F] < /c, it can close a facility in F, or swap an open 
facility in F with a currently closed facility; as usual, the algorithm performs a move only if the total 
cost decreases. 

Since we want to argue about some local optimum F, we can assume that |F| = k: indeed, if \F\ < k, 
then it is a local optimum with respect to all moves — opening, closing, or swapping facilities, and then 
the result of Section 4.1 shows that F is a 3-approximation. However, if \F\ = k, then we cannot open 



facilities even if we want to, and hence have to work harder for the proof. 
4.2.1 Pairing for k-VFL 

As with previous proofs, the central ingredient of the proof is an appropriate pairing that allows us to 
bound the cost of the local-search solution by applying the local optimality condition. To generate the 
pairing, we proceed as follows. 

• Pair degree-1 facilities f £ F with i]~^{f). The degree-1 group is called "single." 

• Pair higher degree facilities in the following manner. If deg(/) > 2, let Pj, = ?7~^(/) = 
{/o , /i , • • • , /*}, where /q is the facility closest to /, and the facilities are ordered 
arbitrarily. Additionally, let /o = / and fi, ■ ■ ■ , ft be any t distinct degree-0 facilities in F. Since 
\F\ < \F*\, we will always be able to find enough degree-0 facilities. Let Pj = {/o,/i, ■ ■ ■ ,ft}- 
The pairs {Pf^P^^) are called "heavy strips." 

• At this point, some degree-0 facilities are still left unmentioned; we call them "excess" facilities. 
4.3 Bounding the Cost of a Local Optimum 

We apply the local optimality condition to the pairs as follows. For each single pair (/,/*), we could 
swap /* for /, assigning all j € N*{f*) to /* and j G -/V(/) \ N*{f*) to r]{ip*{j)). The same reasoning 
as before shows that this is a valid reassignment. The local optimality condition, together with the 
projection Lemma 2.4| , gives 



fac(r)-fac(/)+ iO,-A,)+ ^ 20, > (4.26) 

jeN*{f) jeN{f)\N*{r) 

Now consider a heavy strip (Pf, Pj.)- Suppose P/ = {/o, ■ ■ ■ , ft} and P^. = {/q , . . . , /*}. First, we 
could swap /* for /o, assigning all j € N*{f*) to /*, ah j G iV*(/*) n iV(/o) (for i = 1, 2, . . . , t) to /*, 
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and all j € A^(/o) \ (U*^;^A^*(/*)) to This is a legal assignment as can be easily checked. By 

the local optimality condition, we have 

fac(/o*)-fac(/o)+ J] {O, - Aj) 

t 

+ E E {d{j,m-Aj)+ 20,^0 (4.27) 

i=i jeN*if*)nN{fo) iGJV(/o)\(uti^*(/*)) 

Within the same strip {Pf,Pj,), we can also exchange fi for fl (for each i = 1,2, . . . ,t): 

. If we assign all j e N*{f*) n (iV(/o) U iV(/,)) to /* and all j e iV(/;,) \ N*{f*) to 7?(v?*(i)), then 
the local optimality condition yields 

fac(/;)-fac(/i)+ Yl (0,-A,)+ Yl 20, > 0. (4.28) 

j&N*{f*)n{N{fo)UN{fO) j&NimN-'ifl) 

• If we assign all j € N*{f*) to /* and all remaining j G A^(/i) to r]{ip*{j)), then the local optimality 
condition yields 

fac(/*)-fac(/,)+ E (0.-^.)+ E 20, >0. (4.29) 

J67V(/*) jGiV(/,)\JV*{/*) 

Finally, consider deleting the "excess" facilities. For each excess facility /, we could delete it and assign 
all facilities j G iV(/) to r]{ip*{j)) — recall / has degree 0. By the local optimality, we have 

-fac(/) + Y > 0. (4.30) 



Adding up (4.26)-( 4.30| ) across all strips and all groups yields the following claim. 

Claim 4.4 For any j € C, the increase in connection cost is upper bounded by bOj — Aj. 

Proof: Consider a client j. Note that j is uniquely assigned to a facility /* = (/7*(j) in the optimal 
solution. If /* is in the degree-1 group, the increase in connection cost is clearly upper bounded by 
50j — Aj. Otherwise, /* belongs to a heavy strip ({/o, . . . , ft}, {/o , ■ ■ ■ , ft})-: ™ which case we consider 
the following possibilities: 

• If J S N*{fQ), then the increase in connection cost is upper bounded by 20j + 20j + (Oj — Aj) < 
50, -Aj. 

• If J G -/V*(/*) n A^(/o) for some i G [t], then the increase in connection cost is upper bounded 
by d{j, /o ) — Aj + iOj — Aj) + (Oj — Aj) < 30 j — Aj. The inequality follows from the fact that 
d(j,/o*) < dij,^U))+d{ipij),f^) < d{j,^{j))+d{^{j),f*) < dU,^{j))+diip{j),j)+d{j,f*) = 
2A,+0,. 

• If j G N*{f*)r\N{fi) for some i G [t], then the increase in connection cost is clearly upper bounded 
by 2{Oj -Aj)< 50 j - Aj. 

• Otherwise j must belong to N*{f^) \ (A^(/o) U N{fi)). In this case, the increase in connection cost 
is upper bounded by 20j + 20j + Oj — Aj = 50 j — Aj. 
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We conclude that for all j € C, the increase in connection cost is at most 50 j — Aj. ■ 
It follows that 2fac(F*) — fac(F) + '^Zjec^^^i ~ ^j) — resulting in the following theorem. 
Theorem 4.5 At a local minimum, kUFL(F) < 2fac{F*) + 5 J2j d{j, F*) < 5kUFL(F*). 
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